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Abstract. We consider the invariance principle without the classi- 
cal condition of asymptotic negligibility of individual terms. More 
precisely, let r.v.'s {£, n j} and {r\ n j} be such that 

E{$ nj } = E{T\ nj } = 0, E{gj} = = o*,, = 1, 

and the r.v.'s {v\ n j} are normal. We set 




;=i ;=i M 



Let X n (t) and Y n (t) be continuous piecewise linear (or polygonal) ran- 
dom functions with vertices at (tfot,Skn) an d {tkmZkn), respectively, 
and let P n and Q n be the respective distributions of the processes X n (t) 
andy„(f) inC[0,l]. 

The goal of the present paper is to establish necessary and sufficient 
conditions for convergence of P n — Q n to zero measure not involving 
the condition of the asymptotic negligibility of the r.v.'s {£, n j} an d 
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1 Introduction and results 
1.1 Background and Motivation 

The term "non-classical" concerns various limit theorems not involving the con- 
dition of asymptotic negligibility of the individual random variables (r.v.'s). To 
our knowledge, the convergence of the distributions of sums of r.v.'s to the nor- 
mal distribution in the general situation, that is, without the condition mentioned, 
was first considered by P. Levy and M. Loeve flU Chapter VIII, Section 28]. 
A developed theory with necessary and sufficient conditions was built by V.M. 
Zolotarev and his followers, V.M. Kruglov and Yu.Yu. Machis; see, e.g., [fT8l , [|6l , 
IfTOl , the monograph |fT9l , the review part in |fT3l , and references therein. Note 
also that V.M. Kruglov considered the Hilbert space case (see [6]| and references 
in the papers mentioned above.) 

A somewhat different approach - see also comments below - that uses different 
types of conditions, was suggested in [fT2l and lfT3l . In this paper, we proceed 
mainly from the framework of [|T2l and [[T3l . 

In the case of normal convergence and finite variances, the simplest result from 
lfl2l and lfT3l may be stated as follows. 

Let {£,j n } be an array of independent r.v.'s such that E{t,j n } = 0, E{l^ n } = 
g\ < °°> an d for each n, 

£oJ, = i. (l.i.i) 

j 

Without loss of generality, we assume all Oj n ^ 0. 

Let Fj n (x) be the distribution function (d.f.) of £y„, and <t>j n (x) be the normal 
d.f. with the same zero expectation and the same variance; that is, 4> ; „(x) = 
4>(jc/Oj„), where 4>(x) is the standard normal d.f. Set S n = Y^j^jn- 

Proposition 1 (/1721/) For 

P(S n <x)^ ^(X), for all x, as n — >• °o ; (1-1.2) 
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it is necessary and sufficient that 

J \x\ • \Fj n (x) — $>j n (x)\dx —> Q, as n —> °°,for any E> 0. (1.1.3) 

j IW>£ 

(This particular result is presented also in IfTTl and IfPfll .) It is easy to show 
(see, for example, ffT4l p. 3 10]) that in the classical case where max;Gy„ — > 0, the 
Lindeberg condition implies (11.1.31) . so Lindeberg's theorem follows from Propo- 
sitionQ] On the other hand, condition (11.1.31) takes into account possible proxim- 
ity of the distributions of the r.v.'s to normal ones. In particular, if Fj n = <J> jn and 
hence P(S n < x) = <&(x), then (11.1.31 ) becomes trivial. 

It is worthwhile to note also that Proposition[T]is equivalent to Zolotarev's non- 
classical theorem from [fTBl proved much earlier. In the framework of [18], the 
summands were directly divided into two groups: those with "small" variances, 
and the rest. For the r.v.'s from the former group, Lindeberg's condition was im- 
posed, while the summands from the latter group were required to be close to the 
corresponding normal r.v.'s in Levy's metric. Such a division into two groups re- 
flects the essence of the matter: "small" summands should be in the framework of 
the classical CLT, while "large" summands should be themselves close to normals. 
On the other hand, condition (11.1.31) allows to treat the summands in a unified way. 
Another difference between the theorem from [fTBl and Proposition [JJ is that the 
latter uses an integral metric. 

In the sufficiency case, the result of Proposition [JJ was generalized to the 
case of semi-martingales in Liptser and Shiryaev's paper (HI; see also Jacod and 
Shiryaev's book H VII, 5b; VIII, 4c]. 

To generalize the result above to the case of convergence to distributions dif- 
ferent from normal, one may proceed as follows. Consider another array of inde- 
pendent r.v.'s {t| 7 „}. We assume that for each n, the numbers of terms for ^'s and 
T|'s in the arrays {£,j n } and {r\ ]n } are the same and, just for simplicity, are finite. 
Let E{r[j n } = 0, E{r[ 2 j n } = dj n , and let Gj n denote the distribution of r\j n . The 
problem is to establish conditions under which 

j j 

where product of distributions is understood in the sense of convolution, and con- 
vergence =^ itself is weak convergence (with respect of all continuous bounded 
functions). At least formally, this is a more general setup, since (11.1.41 ) does not 
presuppose the existence of limits for YljFjn and FL Gj n separately. On the other 
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hand, in the particular case when Gj n = <I> 7 „, (11.1.41) clearly coincides with (11.1.21) 
in view of (11.1.1b . 

In the general situation (11.1.41) . instead of (|1.1.3I) . we consider the condition 

J \x\- \Fj n (x) — Gj n (x) \dx — >• 0, as n — ► oo, for any £ > 0. (1.1.5) 

j \\x\>z\ 

In BH, it was shown that when Gj n are Poisson, (11.1.51) remains to be a necessary 
and sufficient condition for the fulfillment of (11.1.41) . however attempts to obtain 
a similar result in the general case failed. The situation became clear when in 
[fT5l and [fT6l it was proved that in general, relation (11.1.5b is necessary for a more 
stronger type of convergence. Namely, (11.1.5b proves to be true if and only if 

l\F jn - J} G 7 „ ^ as n ^ oo, (1.1.6) 

jeB n jeB„ 

for any sequence {B n } of subsets of the indices j. See [15] and [fT6l for detail; 
note also that in [fT6l the case of infinite variances is considered as well. 

The fact that in the normal case, (11.1.4b and (11.1.6b occur to be equivalent is 
connected with the fact that normal distributions are only possible components 
of the decomposition of the normal law. The same concerns the Poisson case, 
however in general, relations ( 11.1.4b and (11.1.6b are certainly not equivalent. 

Next, note that (11.1.6b deals with all possible partial sums, so if we manage to 
establish the validity of this relation, it is natural to continue and consider a more 
sophisticated problem, namely, the asymptotic proximity of the distributions of 
the partial-sum-processes based on the r.v.'s {^j n } and 

The main goal of this note is to point out the fact that condition (11.1.3b is 
necessary and sufficient for the validity of invariance principle in the case of 
Gaussian limiting processes in the general, that is, non-classical setup. To our 
knowledge, this fact has not been aired yet, though as we will see, in view of 
already known results, the proof turns out to be not very difficult. 

Note also that, as a matter of fact, we consider a slightly more general problem 
of proximity of the distributions of the polygonal process generated by the above 
r.v.'s £,j n and the polygonal process generated by the corresponding normal r.v.'s. 
In the classical case, when maxj Oj n — ► 0, such a result clearly corresponds to the 
classical invariance principle of Donsker-Prokhorov (B2l, ifTTI ). however without 
the condition mentioned we deal with a somewhat more complicated situation. 

We hope to consider a more general case of non-normal limiting distributions 
in the next publication. 
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1.2 Results 



As was mentioned, we assume for simplicity that for each n, the numbers of terms 
in each array, {£,j„} or {%«}, are finite. Suppose all T|'s are normal, so Gj n (x) = 
<&j n (x) = &(x/<5j n ). We again assume ( 11.1.11 ) to hold, and set 

Sn = ^^;7i) Y n = V, 1 ! jn, 

j j 
k k 
Skn = Yjcn = T| jn i 

j=i 7=1 

k 

tkn = £oJ,. (1.2.1) 

Let X n (t) and Y n (t) be continuous piecewise linear (or polygonal) random func- 
tions with vertices at (^,5^) and (tkn,Ykn), respectively. Let T n and Qn be the 
respective distributions of the processes X n (t) and Y n (t) in C = C[0, 1]. 

Theorem 2 Condition ftl.l.3\) is necessary and sufficient for 

<P„-Qn^0 (1.2.2) 

(more precisely, to zero measure) weakly. 

Below, we show that the sequences {P n } an d {Q,i} arQ relatively compact, 
and hence in our case the above convergence is equivalent to that in the Levy- 
Prokhorov's metric 71, that is, n(T n , Qn) — > 0. In general, when compactness 
does not take place, and so to speak, "parts of the distributions move to infin- 
ity", asymptotic proximity of distributions even in the one-dimensional case may 
be defined in different ways, so the very notion of proximity requires further anal- 
ysis. We consider this question separately in 0]. 

We supplement Theorem [2]by the following simple proposition. Let for each 
n, the function d~(t) = E{X% (?)}. Clearly, o n (t) is continuous on [0, 1], 

k 

GtSfkn) = Y< a jm 

and in each segment [f(jfc-i) n ,ffoi]> the function o„(t) is a quadratic function. 
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Proposition 3 The process Y n (t) converges in distribution to a Gaussian process 
Y(t) on [0, 1] such that E{Y(t)} = and E{Y 2 (t)} = o 2 (t) if and only if for each 
t G [0, 1], 

a n (t) ->o(t). 

Ifmaxj<3j n — > 0, then <3 2 {t) = t, and Y(t) is the standard Wiener process. 
In general, the segment [0, 1] may be divided into two sets, A and B, with the 
following properties. 

The set A is a union of a finite or countable number of segments, and on each 
such a segment the process Y(t) is linear. 

The set B = [0, 1]\A, and if a segment [a,b] C B, then the process Y(a + s) — 
Y(a) is the standard Wiener process for s G [0,b — a}. 



2 Proofs 

The main issue is to prove the relative compactness of the measure sequences {/P n } 
and {On} (with respect to weak convergence of distributions in C). For brevity, 
we omit sometimes the adjective "relative". 



2.1 Compactness in the normal case 

For the proof below, we need to consider a modification of the process Y n (t). For 
each n = 1,2,... , consider a partition of [0,1) into some intervals [sn_i\ n ,Sj n ) 
where j = l,...,m„ < °°, and = so„ < s\ n < .... The number of intervals may be 
infinite, points Sj n may differ from the points tj„ above. 

Let W n (t) be a continuous piecewise linear process such that W n (0) = 0, on 
each interval [sn-i\ n ,Sj n ) the trajectory of the process is linear, and each incre- 
ment W n (sj n ) — W n (sfj_i\ n ) is either equal to zero, or to a normal r.v. C,j n with 
zero mean and a variance of Sj n — sn_i\ n . We prove the relative compactness of 
the family of the distributions of W n (t). 

In accordance with a well known criterion (see, e.g., Q), it suffices to prove 

that 

(A) su Pn P{|W n (0)|>A}^0asA^oo; 

(B) there exist constants a, b, c > such that for any n and t , s G [0,1], 

E{\W n (t)-W n (s)\ a }<c\t-s\ l+b . 
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In our case, (A) is obvious. We verify (B) with a = 4, b = 1. 
Setvj n =E{(W n {sjn)-W n (s(j_q n )) 2 }. By the definition of W n , either v 2 jn = 0, 

If both points t,s G [*(jfc-i) B >*foi] f° r some > 1, and Vy n /^0, then 

£{|W„(0-^(5)in=£|(^^) 4 |<^^3vt 3 <3k-5| 2 (2.1.1) 

since in this case \t — s\ <v 2 n . On the other hand, ifvjbi = 0, then W n (t) — W n (s) = 
0, and (12.1.11) is clearly true. 

If t = Sk„ and s = s mn for some k and m > k, then the r.v. W„(t) — W n (s) is 
normal with a variance that does not exceed s mn — s^ n . Then 

£{|W, 7 (0-^W| 4 }<3(^-^) 2 = 3|?-^| 2 . 

In general, ifte [s^-i) n ^kn\ and 5 G [si m -\)m s mn] for some and m> k, then 
in view of the above bounds, 

E{\W n (t) -W n ( S )\ 4 } < E{(\W n (t)-Wn(s kn )\ + \ W n (s kn ) - W n ( S{m _ 1)n ) \ 

+\Wn{s im _ 1)n )-W n ( S )\) 4 } 

<27{£{|^(0-^(^)| 4 }+ J E{|W„(^)-^(5 (m _ 1) J| 4 } 
+ E{\W n (st {m _ l)n )-W n ( S )\ 4 }} <243\t-s\ 2 .M 

2.2 Compactness of {fp„} 

First, note that in [8 , Lemma 2], relative compactness in the non-classical situation 
was established in the general case of local martingales with respect to weak 
convergence in D. However, it is not exactly what we need since we consider 
convergence in C. 

Certainly, once we consider continuous processes, and if limiting processes 
are also continuous (which is true in our case), compactness in D implies con- 
vergence in C. However, when considering piecewise linear processes like X n {t) 
we loose the martingale property even when the r.v.'s £,j„ are independent. On 
the other hand, if we switch to piecewise constant processes, we have to consider 
convergence in D, which is not enough for us. 

We believe that this is a technical obstacle and it may be somehow fixed, but 
in any case, in our opinion, a self contained (and relatively short) proof for the 
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situation of independent summands would have an intrinsic value. So, we provide 
this proof. 

Thus, we establish relative compactness of {!?„} in C under condition (11.1.31) . 
Set k jn = [ty 

tjn], where the points t j n are defined as in (11.2.11) . For 8 > 0, 
we define the process X n (t; 8 ) as a result of replacement of the r.v.'s \j n by the r.v.'s 
\j n = ^ 7 - ;i l{Oy n > 8} in the definition of X n (t). (As usual, 1{A} is the indicator of 
a condition A.) 

First, we show that for a fixed 8 > 0, the family of the distributions of X n (t;d) 
is compact. Indeed, denote by k^ jn = [rf _j> , rf nn ] the segments k,„ where the 

process X n (t; 8) is not constant. We assume that kf nn is on the left of k? +1 s . 

Since 8 > 0, the number of the segments k m „ is finite. Denote this number by 
q(n, 8). Clearly, q(n, 8) < q = [1 /8] where [a] stands for the integer part of a. It is 
convenient to think that always m = l,...,q, setting kj^ = [1,1] for m > q(n,b). 

Clearly, there exists a subsequence n = and segments k^ = [r? 
m = 1, ...,<?, such that 

k§ „ _> k§ as j ^ oo 

(that is, the corresponding endpoints of the segments converge). 

On the other hand, for each kj^ n , the distribution of the increment X„(r^„; 8) — 
^ n ( r fm-i)n' ^) * s ec l ua ^ *° a distribution F/„ for some j. Then from the main condi- 
tion (11.1.3b it follows that the distribution of X^(r^~; 8) — JGi(r^ n _ 1 ^~; 8) weakly 
converges to the normal distribution with zero mean and the variance equal to the 
length of k^. (We skip a formal proof of this fact. Because ( 11.1.3b is true for any 
£ > 0, we have convergence in the corresponding integral metric on any segments 
[e,°°) and (— oo, — e]. This implies weak convergence. Since the limiting distribu- 
tion is continuous, we have as a matter of fact uniform convergence, but we do not 
need it.) 

Since the distribution of the process X^(-) is uniquely specified by the finite di- 
mensional distribution of the increments on the segments kj^, we finally conclude 
that the distribution of X~ ( • ; 8) weakly converges to the distribution of a contin- 
uous piecewise linear Gaussian process W(t; 8) having points of growth only in 
the segments kjj, and such that the increments W(r^; 8) — W(r^ m i y 8) are normal 

with zero mean and variance r^ — rf . v 

m [m— 1) 

Now, we proceed to a direct proof of compactness. Consider a sequence of 
positive numbers 8„ — > 0. As was shown, there exists a subsequence n^ 1 ) = {ft,- } 
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such that 

X(i)(.;8i)4w(-;8i)as/->oo, 

where 4> stands for weak convergence of the corresponding distributions, and 
(■) is a Gaussian process of the type V7 8 (-) described above. 
Similarly, we can choose a subsequence n^ 2 ) of the sequence n^ 1 ) such that 

x n m ( • ; 82) 4- W( ■ ; 82) as / -> 00, 

where W §2 (-) is a Gaussian process with the same properties as above. Continuing 
to reason in the same fashion, we come to a nested sequence of subsequences 
n (i) D n (2) ^ ... such that for all* = 1,2,..., 

X (*)(•; 8jt) 4>W(-;8fc) as i^oo. 

Next, consider the sequence of the Gaussian processes • ; 81), W(- ; 82), ...}. 
By the result of Section I27T1 there exists a subsequence m ; such that 

w(-,b mj )Aw(-), 

where W(-) is a Gaussian process. 

Now, we censor the sequence D D ... , choosing only n*™ 1 ) D n*™ 2 ) I) 
... .By construction, we can choose a sequence hi,«2, ••• sucri mat 

«!GnW, « 2 enW ) ...,n i 6nW, ... 

and 

Z n .(-;8 mi ) 4w(-) as 1 ->«>. 

At the last step of the proof, we set Z„(f ; 8) = X n (t) —X n [t; 8), and consider the 
sequence of the processes Ufa) = Z ni (t; 8 W; ). Each process Ufa) is a continuous 
process that is linear on each segment k/ n; and such that the variance of the incre- 
ment of the process on each kj nj does not exceed 8 mi . Since 8 ra , — > as i — > °°, 
all increments are asymptotically negligible. Formally, the processes {Ufa)} are 
not exactly of the type appearing in the classical invariance principle since for 
a finite number of segments k (with appropriate indices), the increments equals 
zero rather than having a variance equal the length of k. Nevertheless, the proof 
of compactness may run exactly as, e.g., in the classical proof from Prokhorov's 
paper ifTTl Section 3.1]. 

Thus, the sequence of the distributions of Ufa) is compact, and so does the se- 
quence of the distributions of X n .(- ; 8 m; ). It remains to observe that the processes 
Ufa) and X n fa ; 8 m J are independent. 
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2.3 Proof of Theorem g] 



2.3.1 Necessity 

Let 

Pn-Qn^O (2.3.1) 

weakly in C[0, 1]. As was shown in Section I2TT1 the sequence {Qn} is compact. 
Then {?P n } is compact either. 

Now, since Y,j0 2 j n = 1, the marginal distribution function for Y n , i.e., P(Y n < 
x) = <i>(x). Hence, in view of (12.3.11) . 

P(S n <x)^<P(x). 

By virtue of Proposition [T] this implies the validity of (11.1.31) . 

2.3.2 Sufficiency 

Assume that condition (|1.1.3I) holds. Then, as was proved above, both sequences, 
{fp„} and {Qn}, are compact. Hence, it suffices to establish the convergence of 
the differences of all finite-dimensional marginal distributions. 

Let ti <t 2 < ... < t k be points in [0,1]. Set X n (t h ...,t k ) = (X n (t { ), ...,X n (t k )) 
and Y n (ti,...,t k ) = (Y n (ti),...,Y n (t k )) and denote by P n (t u ...,t k ) and Q n (h,...,t k ) 
the distributions of the random vectors X n [t\ , . . . , t k ) and Y n (ti , . . . , t k ) , respectively. 
Both sequences, {P n (h, ...,t k ) and Q n (h, ...,t k ), are compact. 

We should prove that 

P n (t h ...,t k )-Q n (t h ...,t k )^0. (2.3.2) 

Let the half interval r(j,n) = [t(j-i) n ,tjn), an d the relations ti E r(m in ,n), / = 
1 . , , , .k, define the integers m in . Then for i = 1 , . . . , k, 

X n (ti) = S( m . n _i) w H — 2 ' U»n (2.3.3) 

Yniti) = Z {min _ l> + t ^^r [mintl . (2.3 A) 
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For each n, consider the random vectors 

'm\ n -\ m 2n -l 
7=1 7=«1h+1 



W£„-l 

£ £>7«> ^nl(mfe>m fc _i) ], (2.3.5) 

j= m (k-l)n+ l 



and 



£ Tlyn, T]m 2 nl(m 2 > mi), ... 

7=1 7= m ln+l 

£ T\jn, r\m k nHm k > m k _\) ] (2.3.6) 

j= m (k- L)n+1 

where, by convention, £^ = for a > 

Vectors (|2.3.5I) and (|2.3.6I) are those with independent coordinates and are 
of the fixed dimension 2k. Denote the jth coordinates of these vectors by V I / j n , 
and T jn , respectively, and set = ( v I / i„, ..., x ¥ 2k ,n), = (T ln , ...,Y 2 k,n)- Let the 
symbol (Px denote the distribution of a r.v. or a random vector X. 

First, note that the families of the distributions {(P «p n } and {2> y } are compact. 
Second, by results of [TT5l - [[T6l mentioned in the Introduction, condition (11.1.31) 
implies that 

jeB„ j€B„ 

weakly for any sequence {B n } of sets of indices. In particular, this means that 

weakly for each j. Since the coordinates of the vectors *P n and T ;1 are independent, 
this implies that 

On the other hand, in view of (I2.3.3I ) and (I2.3.4I ), each r.v. X n (ti) is a linear com- 
bination of the r.v.'s ^jn, and each r.v. Y n (ti) is the linear combination of the r.v.'s 
Tj n with the same coefficients as for X n {ti). Together with the compactness of 
V «p n and <£ Tn , this leads to (I2.3.2I) . ■ 

Since the sequence of the distributions {Q«} is compact, the proof of Proposi- 
tion |3]is straightforward, and we skip it. 
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